Features Selection in Water Quality Prediction in Neural Network using Canonical Correspondence Analysis (CCA)
نویسندگان
چکیده
Irrelevant inputs can cause deterioration of the network performance. This paper aims to implement features selection for water quality prediction model as the neural network generalization may be improved when the number of measured variables is reduced. Canonical Correspondence Analysis (CCA) was introduced as a feature selection method, where it choose a subset of input variables by eliminating features with little or no predictive information. Data monitoring and sampling process was carried out at 5 sampling sites of Perak river basin. The water quality data with 28 different parameters was used to determine the relative effect for each parameter to the biological oxygen demand (BOD) and chemical oxygen demand (COD) as they represent the river water quality; determination of the pollutants strength in terms of the oxygen necessity to reduce the domestic and industrial wastes, hence giving an initial value how much biodegradable waste existed in the water. While, for COD, its application is practical since it verify the amount of total oxidizable compounds in water. Results of CCA through the biplot diagram showed that suspended solid (SS), turbidity (TUR), total solid (TS), nitrate (NO3) and zinc (Zn) were most important environmental factors that influencing the BOD and COD. The study continued with a number of neural network approaches implemented for predicting BOD and COD. The results showed comparable performance of water quality prediction with CCA implementation in the network compared to ANN standalone model.
منابع مشابه
Effects of environmental factors on species diversity of rotifers using biodiversity indicators and canonical correlation analysis (CCA)
Rotifers are microscopic aquatic animals of phylum Rotifera, live in a diverse range of aquatic habitats. They are important in ecology of freshwater ecosystems by recycling nutrients and can alter trophic dynamic of planktonic communities. These features have also been used to infer environmental conditions in an aquatic habitat. Because of the important roles of this group of animals in the t...
متن کاملBenthic Macroinvertabrate distribution in Tajan River Using Canonical Correspondence Analysis
The distribution of macroinvertebrate communities from 5 sampling sites of the Tajan River were used to examine the relationship among physiochemical parameters with macroinvertebrate communities and also to assess ecological classification system as a tool for the management and conservation purposes. The amount of variation explained in macroinvertebrate taxa composition is within values r...
متن کاملDiscrimination of Power Quality Distorted Signals Based on Time-frequency Analysis and Probabilistic Neural Network
Recognition and classification of Power Quality Distorted Signals (PQDSs) in power systems is an essential duty. One of the noteworthy issues in Power Quality Analysis (PQA) is identification of distorted signals using an efficient scheme. This paper recommends a Time–Frequency Analysis (TFA), for extracting features, so-called "hybrid approach", using incorporation of Multi Resolution Analysis...
متن کاملApplication of artificial neural network (ANN) for the prediction of water treatment plant influent characteristics
Application of a reliable forecasting model for any water treatment plant (WTP) is essential in order to provide a tool for predicting influent water quality and to form a basis for controlling the operation of the process. This would minimize the operation and analysis costs, and assess the stability of WTP performances. This paper focuses on applying an artificial neural network (ANN) approac...
متن کاملThe Prediction of the Tensile Strength of Sandstones from their petrographical properties using regression analysis and artificial neural network
This study investigates the correlations among the tensile strength, mineral composition, and textural features of twenty-ninesandstones from Kouzestan province. The regression analyses as well as artificial neural network (ANN) are also applied to evaluatethe correlations. The results of simple regression analyses show no correlation between mineralogical features and tensile strength.However,...
متن کامل